A Deep Survey of HI— Selected Galaxies: The HI Mass Function 
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ABSTRACT 



We analyze the distribution of galaxy HI masses detected in a large, deep 
HI survey conducted at the Arecibo observatory, and we find possible evidence 
of a faint-end steepening of the mass function similar to what has been found 
optically. This is the first HI survey with enough dynamic range to see this 
steepening; the results of an earlier survey are found to be consistent when the 



detection statistics are re-examined. We demonstrate a technique for testing 
and correcting source count completeness in HI surveys based on the V/V max 
test and the large scale structure in the regions surveyed. 
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Subject headings: galaxies: luminosity function, mass function — large-scale 
structure of universe — radio lines: galaxies 



^ ■ 1. Introduction 



Recent optical studies indicate that the faint end of the galaxy luminosity function may 
grow significantly steeper than the power-law slope seen at intermediate ranges (Marzke, 
Huchra, & Geller 1994a; Driver & Phillipps 1996; Loveday 1997). The rise appears to be 
present in both cluster and field galaxies, and is particularly strong among Magellanic 
spirals and irregulars (Marzke et al. 1994b). This is intriguing because if the faint-end slope 
is steep enough, a significant fraction of baryonic matter may be bound to small galaxies. 

Since optical counts of low-luminosity sources may have subtle selection effects, several 
groups have taken an independent approach, making "blind" searches for extragalactic 
21 cm HI emission (see Spitzak & Schneider 1998, Paper I, and references therein). HI 
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observations are advantageous for detecting low-luminosity systems because starlight is not 
needed to power 21 cm line emission, and HI is generally abundant in field dwarfs. 

Our HI survey in the "Arecibo Slice"0 (Paper I) detected 75 HI sources and is the first 
blind HI survey to detect sources over a range of masses comparable to the range of optical 
luminosities in samples in which the faint-end steepening is seen. One other published 
survey, the Arecibo HI Strip Survey or 'AHISS" (Sorar 1994; Zwaan et al. 1997, hereafter 
ZBSS), samples a comparable area of the sky, however it was carried out and analyzed in a 
different fashion. In our Arecibo Slice, closely spaced, pointed observations were made so 
that all sources were detected in the telescope's main beam, and the effective sensitivity 
over the survey area was relatively uniform. In the AHISS, driftscans were used and sources 
were detected all over the main beam and sidelobes of the telescope. 

The most important information needed for converting survey detections into a mass 
function is a thorough understanding of the survey's completeness. Both the Arecibo Slice 
and AHISS have now been followed up by confirmation observations that give accurate 
fluxes for the sources, but to determine the completeness we must understand the sensitivity 
to sources in their original detection scans. For this reason the sidelobe detections of 
the AHISS are problematic. The Arecibo sidelobes had asymmetries and temperature 
dependencies that made their sensitivity uncertain. Moreover, the original detection fluxes 
of AHISS sidelobe sources were not saved (Sorar 1998, private communication), so we 
cannot apply completeness tests after the fact. For these reasons, we will consider only the 
45 main-beam AHISS sources in the remainder of this paper. 

The problem with sidelobe source sensitivity is also the main reason why an analysis of 
the original results from the AHISS by Schneider (1997) gave different results from ZBSS, 
who attributed the difference to using the original Arecibo detection fluxes versus their new 
VLA measurements. This is unlikely to explain the difference since gain variations due to 
the uncertain positions should have introduced a shift and a scatter in the results that were 
smaller on average than the bin size used in the mass function. Instead, we believe the 
difference arises from ZBSS's inclusion of low-sensitivity sidelobe detections — 17 of 18 of 
these had high HI masses M HI > 1O 9 M . (Note that we use Galactocentric velocities and 
assume H = 75 km s _1 Mpc -1 .) Including these sources could easily bias the apparent 
slope of the HI mass function because of the uncertainties in sidelobe behavior. 

Rather than entering into further debate about details of observational technique, we 
propose to demonstrate and apply a technique for directly determining the completeness of 
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HI surveys. We show how to use an estimate of the density variations due to large scale 
structure (LSS) in the area of a survey to test and correct for survey sensitivity. Based on 
the available data, we show that the Arecibo HI surveys are consistent with a steepening 
at the faint end of the HI mass function, which is quite similar to the results of Loveday 
(1997) for optically-selected field galaxies. 

We first examine the LSS in the regions of the two HI surveys in §2. Using the LSS 
information, we determine the behavior of the sensitivity limits of the HI surveys in §3. 
Then in §4, we derive the mass function for the Arecibo surveys. We conclude with a 
summary and discussion of the results in §5. 



2. Large Scale Structure in the Survey Regions 

Using optical redshifts we can make LSS density estimates as a function of redshift over 
the areas surveyed for HI. These density estimates should be appropriate for our purposes 
since HI sources qualitatively trace the same structure as optically identified galaxies 
(Paper I). We use galaxies from the "RC3" (de Vaucouleurs et al. 1991) that are within 
5 = ±10° of the narrow declination bands examined in each HI survey. The RC3 sources 
with redshifts in these regions appear to be complete to a limiting magnitude of mu m < 14.5 
based on their number counts increasing like 10°- 6m . Since the AHISS strips cross through 
the Galactic plane the RC3 coverage of them is incomplete, but we shall show later that the 
results are consistent with the Arecibo Slice and the effect of LSS is small in any case. 

For a galaxy with apparent magnitude m^, the maximum redshift at which it would 
remain brighter than mi im is z maXji = ZjlO <mHm ~ m-i )/ 5 , where we assume the redshift Zi is 
directly proportional to distance. The number of galaxies we expect to observe with z < Zi 
but z max > Zi is. 

30(M t ) 

Ni = —Pi*) , (1) 

z i 

where p(zi) = J Zl p{z)z 2 dz is the mean relative density out to redshift Zj. The quantity 
0(Mj) represents an integral of the luminosity function, but since it divides out in the end, 
its form need not be known. 

We can write a recursive relation between the mean density for the galaxies out to Zi 
and out to the galaxy with the next smaller redshift Zi-\. 



( Ni - l)/z f 
Ni/zU 



Afc-i) = M - ' Z 1 J - • (2) 



We normalize p = 1 averaged over the full redshift range of the survey. (This normalization 
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Fig. 1. — Estimated density of galaxies as a function of redshift in the regions surrounding 
the HI surveys. The solid line is for the Arecibo Slice (Paper I), and the dashed line is for 
the AHISS. 

may affect the overall scaling of the mass function, but not its shape.) This gives us the 
mean relative density for all smaller redshifts which we differentiate to find the local relative 
density p(z) shown in Fig. [1| (To produce this plot the data were binned and smoothed 
with a Gaussian of 500 km s~ x FWHM.) Both surveys show an excess of galaxies around 
v = 5000 km s _1 caused by the Pisces-Perseus supercluster. 

The density of galaxies also rises at small redshifts in the Arecibo Slice due to the local 
supercluster. This nearby rise might exaggerate the counts of low mass sources since they 
are detectable only out to small velocities. However, we shall show in §5 that the magnitude 
of the effect is small. A more useful aspect of our LSS determination is that it permits us 
to test the surveys' completeness. 
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3. HI Survey Completeness 

To determine the number density of sources of a given HI mass, we must accurately 
establish the volume within which a survey is sensitive to them. This is complicated by 
several factors: (1) the same galaxy observed at different inclinations will have different line 
widths; (2) blind HI surveys detect objects with sensitivities that depend on distance from 
beam center and frequency; and (3) efforts to reject man-made interference and to subtract 
instrumental "baselines" can artificially suppress real signals. Since these HI surveys include 
only confirmed sources, we do not need to worry about the opposite problem of including 
unreliable sources, but we need some means of establishing the surveys' completeness. 

For a galaxy with a total signal S (integrating the HI flux density over the 21 cm line 
width w) the peak signal-to- noise ratio S/N is achieved when the spectrum is smoothed to a 
velocity resolution equal to w. In this ideal case, S/N oc w~ 1 ^ 2 (see, for example, Schneider 
1997; ZBSS). In Paper I we found our minimum detected fluxes were at a signal-to- noise 
S/N ^ 5 based on this description. 

The noise level in the AHISS spectra varied substantially in different regions depending 
on the total number of drift-scan spectra, which were collected once per day and then 
averaged. When we use the noise value for each source (based on the number of observations 
averaged in its vicinity according to Sorar 1994) we find a lower limit of S/N ^ 7. This 
higher limit probably reflects the fact that Sorar (1994) confirmed whether sources were 
detected "by inspecting the data for each day separately." Sources close to his quoted 5a 
search limit would be only 1— 2a in a single day's data and very difficult to confirm. 

A direct test of whether a survey is complete to its quoted sensitivity limit is to 
compute V/V m ax = (z/z max ) 3 . This should average to 1/2 if z max is correctly determined 
(Schmidt 1968). In this test we set z max to the smaller of the bandpass limit and the highest 
detectable redshift based on the claimed detection limit. We find V/V max averages to 0.39 
for the Arecibo Slice using our quoted 5a limit. We find an even lower average, 0.34, for the 
AHISS sources using their quoted 5a limit, or 0.41 using our estimated 7a limit. 

To determine whether LSS could affect V/V ma x, we examine its dependence on redshift. 
The heavy line in Fig. |2| shows the expected behavior as a function of redshift for the density 
distribution derived in §2. (At each z max we consider all sources i with Z{ < z max < z max>i .) 
The dotted lines show the observed behavior of V /V ma x based on the suggested detection 
limits of the two HI surveys. The values are systematically below 1/2 even at redshifts 
where LSS should make them high. 

This should not be surprising, since a detection limit is normally lower than a 
completeness limit. We can force the average value of V /V ma x to be m 1/2 by assuming 
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Fig. 2. — Expected and observed values of V/V max as a function of z max for (a) the Arecibo 
Slice, and (b) the AHISS. The heavy solid line shows the expected distribution derived from 
the density structure in Fig. |I[ The dotted and dashed lines show the distribution determined 
from the HI data assuming 5 and 7— a completeness limits (7 and 10-cr for the AHISS). The 
thin solid line shows the results for an empirical model where the detection depends on the 
line width to the 0.75 power (see text). 
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the S/N must be 40% higher for the completeness limit (dashed lines), but the result is 
systematically too high at low redshifts. A better explanation is that the completeness limit 
does not actually depend on a source's line width like w - 5 as assumed above. We noted 
indications of this in Paper I — narrow-line sources were detected to lower S/N. We have 
experimented with sensitivities that depend on w x , defining a new "signal-to-noise" ratio: 

(300km s^/wresy-QSfSdv 
[w/w res ) x x rms x w res 

where rms is the noise measured at the spectral resolution w res , J S dv is the integrated flux 
in the line, and the result is normalized to match S/N at w = 300 km s _1 . 

A good empirical fit for the completeness limit is found with x = 0.75, which suggests 
that a minimum flux density also plays a role in detection. For x = 0.75 and S/N new = 7 
(S/N new = 10 for AHISS detections), we get the thin solid-line curves in Fig. |], which match 
the expected V/V max behavior quite well. By inverting eqn. [J[ these values also define the 
"completeness flux" for a source, (/ S dv) comp , for which the survey is on average complete. 



4. The HI Mass Function 

Equipped with knowledge of LSS and survey completeness, we can derive the HI mass 
function. We do this by means of the Vtot method. ZBSS refer to this as the Hl/V max 
method, but we want to avoid confusion with V max as defined in §3. Vtot describes the total 
"completeness volume" in which a source should on average have been detected over the 
entire survey area, taking into account offsets from beam center, etc.; V m ax describes the 
maximum distance a source should have been detected in its actual detection spectrum. 
The completeness volume of each source implies an overall density of sources: 

$(M 1 <M<M 2 )= 1 /V to *(M i ) (4) 

Mi<Mi<M2 

for sources in the mass range M\-Mi. This method depends critically on an accurate 
determination of the completeness volume, which we discuss first, and is also influenced by 
LSS, which we address subsequently. 

We empirically determined the decline in sensitivity to sources offset from the 
beam center by using the ratio of detection fluxes to true fluxes (found from follow-up 
observations). Because of the hexagonal sampling pattern in the Arecibo Slice, sources were 
never farther than about 2.3' from beam center. We limit our analysis of AHISS sources to 
those within 3' of beam center. 
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The frequency dependence of Arecibo's line feeds changes the gain by up to a factor 
of ~ 2 over the observed redshift ranges. The AHISS 5 = 23° strip presents a problem 
here. The data were collected with the feed tuned to different frequencies on different days, 
making the sensitivity up to 2 x larger or smaller at different points in the bandpass, and 
the detections were carried out on subsets of the data with different frequency dependences. 
Too few details of this were recorded to allow us to reconstruct the exact detection 
characteristics, so we treat the 5 = 23° strip like the 5 = 14° strip, for which the frequency 
dependence remained fixed. 

We determine Vtot f° r each source in the Arecibo Slice and AHISS by numerically 
integrating the search volume over all possible positions within the survey boundaries. We 
include the frequency dependence across the bandpass to determine at which redshifts a 
source's observed flux should be detectable. Finally, the differing AHISS rms noise levels 
were also included in the calculation of the total completeness volume. 

Some of the lowest flux (not necessarily low mass) sources are fainter than the 
completeness flux, (/ Sdv) comp , estimated above. However, this does not affect the shape 
of the mass function at the faint end since Vtot changes by the same factor for all but 
the highest mass sources 10 9 M©). We checked this conclusion by using Monte Carlo 
simulations of sources following a Schechter function (a = —1.1 to —1.5), selecting sources 
statistically to imitate the declining completeness seen in the HI surveys. We found that 
application of the V/V max test and V to t method recovered the input density and slope. 

Another concern in determining the mass function is the effect of large scale structure. 
ZBSS have shown that the Vtot method is fairly insensitive to sinusoidal variations in LSS, 
but it is important to determine whether a local overdensity could have a significant effect 
on the source counts at the low-mass end. 

Our test of the possible significance of LSS is to use the optically-based density 
structure determined in §2. Each source's value of V to t is corrected to account for the 
probability of finding a source within its completeness volume. This is done by dividing Vtot 
by the mean density out to the source's maximum detectable redshift: p(z max ). Since this 
depends on beam offset and the other sensitivity factors, we calculate the corrected volume 
in our numerical integrations of the completeness volume described above. 

The joint mass function for the Arecibo Slice and AHISS is shown by solid circles with 
Poisson 95% confidence limits (Gehrels 1986) in Fig. |3|. To calculate the joint mass function 
requires that we determine each source's completeness volume in the other survey. This 
is straightforward since line widths and source fluxes were measured similarly in the two 
surveys, giving us a sample of 120 HI sources. 
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Fig. 3. — The HI mass function determined from Arecibo surveys of HI sources. Solid circles 
with 95% confidence limits show the joint mass function of the Arecibo Slice and AHISS. 
Several subsets and alternative reductions of the data are also shown without error bars, 
and are offset right and left of the joint mass function points for clarity. Open circles show 
the mass function of the Arecibo Slice alone; open squares, AHISS; open triangles, AHISS 
restricted to the 5 = 14° strip. Solid squares show the mass function derived when no 
corrections are made for large scale structure. The dotted curve shows a Schechter function 
with a = —1.2 suggested by ZBSS, and the dashed curve is the functional form of the 
faint-end turn-up found by Loveday (1997) for optical sources. 
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The LSS turns out to have only a minor effect on the resulting mass function. Solid 
squares show the mass function we would have determined directly from the Vtot method 
with no LSS corrections. (Note that the points are offset to the left for clarity.) There is 
a small adjustment at low-masses, but the local overdensity would have to be many times 
larger for this to have generated the higher value we see. 

The results from the Arecibo Slice alone are shown by open circles in the figure; from 
AHISS, by open squares. We also show results for the 5 = 14° strip of AHISS (open 
triangles) alone, which did not suffer from the problems with the 5 = 23° strip described 
earlier. The 5 = 14° results generally agree better with the Arecibo Slice results. 

5. Summary and Discussion 

The present combination of the Arecibo Slice and Arecibo HI Strip Survey yields the 
largest and deepest sample of Hl-selected sources studied to date. The HI mass function 
derived from these surveys is suggestive of a turn-up at the low-mass end of the HI mass 
function similar to that seen at the faint end of the optical luminosity function. In Fig. |3] 
we show the empirical luminosity function found by Loveday (1997) scaled to match the 
turnover at high HI masses. The turn-up we see at low HI masses is at about the same 
relative location Loveday found optically. The rise at the faint end of the HI mass function 
appears to be in good agreement with the optical results of Marzke et al. (1994b), who 
found that the rise results primarily from late-type galaxies. 

ZBSS have argued that the HI mass function is consistent with a relatively shallow- 
sloped rise like the a = —1.2 Schechter function shown by a dotted curve in Fig. |3|. We find 
an apparent rise in the lowest 10 7 M© bin, in which two HI sources were detected in the 
Arecibo Slice. The AHISS detected no sources with such low masses, but we would have 
expected only 1-2 sources (< 1 source in the 5 = 14° strip), based on the relative volume 
sensitivities. Obviously these are small number statistics, but the 95% Poisson confidence 
limit excludes the Schechter fit proposed by ZBSS. 

The differences we find in this paper result from the greater success of the Arecibo 
Slice project in detecting low-mass HI sources, the rejection of sidelobe sources from the 
AHISS, and the completeness tests we have applied. We believe that many previous surveys 
have overestimated their sensitivity and hence their completeness volumes (Vtot), and 
therefore underestimated the counts of low mass sources. The rise at the low-mass end of 
the HI mass function is still a relatively weak statistical result, but we have shown that 
it cannot be attributed to two potential observational issues. (1) Narrow-line sources are 
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easier to detect than wide-line sources at the same statistical S/N. We have determined 
that this is described by an effective U S/N" oc w 0J5 and adjust our results accordingly. (2) 
The overdensity of sources in the Local Supercluster can also exaggerate low-mass counts, 
but we show this effect to be minimal, at least if HI sources are no more clustered than 
optically-selected sources. 

The volume of space so-far surveyed at high enough sensitivity to detect low Hi-mass 
sources is still small, so large new surveys currently planned or ongoing should definitively 
establish the shape of HI mass function. However, the results will remain only as accurate 
as the determination of the surveys' actual completeness limits. We have demonstrated in 
this paper how to use information about the LSS and a V/V max test to directly determine 
an appropriate sensitivity limit, and we recommend that other HI surveys test their 
completeness similarly. 
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